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ABSTRACT 

At the present time, early-type brightest cluster galaxies in the SDSS MaxBCG and C4 
catalogs have larger sizes than early-type galaxies of similar luminosity, whether these 
other objects are in the field, or are satellites in clusters. BCG sizes are also stronger 
functions of luminosity Re oc L than are the sizes of the bulk of the population; this 
remains true if one restricts attention to narrow bins in velocity dispersion. At fixed 
stellar mass and formation time, objects at lower redshift are larger and have smaller 
velocity dispersions - i.e. the sizes increase and velocity dispersions decrease with age. 
In addition, at any given redshift, younger BCGs have slightly larger sizes than older 
BCGs of the same stellar mass; however, they have similar velocity dispersions. As a 
result, at redshifts ~ 0.25, corresponding to lookback times of order 3 Gyrs, BCGs are 
smaller than their lower redshift counterparts by as much as ~ 70% for the brightest 
BCGs: the sizes evolve as (l-(-2;)0-85(Afr-i-2i)^ Qualitatively similar but weaker evolution 
in the sizes is also seen in the bulk of the early-type population: at < —22 the 
sizes evolve as (1 -I- z)^-'^i'^'^r+2i) ^ while at Mr > —22 the evolution is approximately 
(1 -|- independent of Mr- The velocity dispersion-luminosity correlation also 

evolves: (1 + ;j)-0-2(m,+21) ^ (as for the BCGs) and (1 + z)°-^ for fainter 

galaxies. The size- and velocity dispersion-stellar mass correlations yield consistent 
results, although, in this case, accounting for selection effects is less straightforward. 
These trends, in particular the fact that the velocity dispersions at fixed stellar mass 
decrease with age, are most easily understood if early-type BCGs grew from many dry 
minor mergers rather than a few major mergers. Only in such a scenario can BCGs be 
the descendents of the super-dense galaxies seen at 2 ~ 2; major dry mergers, which 
increase the size in proportion to the mass, cannot bring these galaxies onto the BCG 
Re — M* relation at z 0. 

We also compared the ages and sizes of our early-type BCGs with other cluster 
galaxies (satellites). BCGs are larger than satellites of similar luminosity or stellar 
mass at the same redshift. Although both satellites and BCGs trace the same weak 
age— 1/ or age— relation, this can be understood by noting that BCGs are typically 
about 1 Gyr older than the satellites in their group, and they are about 0.5 mags 
more luminous. Finally, we find that the mean satellite luminosity is approximately 
independent of BCG luminosity, in agreement with recent predictions based on the 
luminosity-dependence of clustering. 

Key words: galaxies: formation - galaxies: haloes - dark matter - large scale structure 
of the universe 



There has been recent interest in the sizes of galaxies: at 
fixed stellar mass, galaxies appear to be more than three 
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1 INTRODUCTION 



times smaller at z ~ 2 than at 2: ~ (Trujillo et al. 2006; 
Cimatti et al. 2008; Van Dokkum et al. 2008; Younger et al. 
2008; Buitrago et al. 2008; Tacconi et al. 2008; Chapman et 
al. 2008; Franx et al. 2008; Damen et al. 2008; Damjanov 
et al. 2009; Saracco et al. 2008). Similar evolution in the 
size-luminosity relation of radio galaxies was seen almost a 
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decade ago l|Roche et al. 1998)) . This evolution is difficult to 
arrange in models where the galaxies form from a simple 
monolithic collapse. 

On the other hand, qualitatively similar behaviour for 
dark matter halos has been expected for some thirty years: 
a virialized halo at a given epoch is approximately 200 times 
denser than the critical density at that epoch, whatever its 
mass l|Gunn fc Gott 197"2]| . Thus, at fixed mass, the virial 
radius scales approximately as (1 + z)~^. Mass-independent 
densities appear to be a good description of cluster-mass 
halos locally l|Abbas fc Sheth 20071 IJohnston et al. 20081 
IVikhlinin et al. 2008 P and at higher redshifts (e.g., Meneux 
et al. 2008). These halos are expected to have formed from 
essentially dissipationless mergers, so, for systems domi- 
nated by dissipationless merging, we expect that, at fixed 
mass, the radii are larger at late times. 

One arrives at the same conclusion if one considers 
mergers along parabolic orbits (|Ostriker fc Hausman 19771 
ICiotti 2008p . For example, the velocity dispersion of the 
product of a parabolic merger of two equal mass galaxies 
is the same as that of its progenitors (this assumes mass 
and energy conservation). The virial theorem requires that 
if the mass has doubled with no change to the velocity dis- 
persion, then the size must also have doubled, making the 
density smaller by a factor of four. This is the most extreme 
case: if the progenitor masses were unequal, then the density 
of the product is less than that of the more massive progen- 
itor, but by a factor of less than four. Again, dissipationless 
mergers act to decrease the density. 

Galaxy formation was not dissipationless 
(|Fall fc Efstathiou 19801 [Barnes fc Hernquist 199T| ): gas 
dissipation has played an important role, although this 
is expected to have been more true at high redshift 
( [Hopkins et al. 2008p . Subsequent major mergers between 
disk galaxies are thought to be the main way in which 
elliptical galaxies form l|Toomre fc Toomre 1972} . possibly 
followed by dry dissipationless mergers in which both 
the size and the stellar mass of the final object increase 
(Malumuth fc Kirshner 19851 [Robertson ot al. 2006: 

Boylan-Kolchin et al. 2006| [Kh ochfar fc Silk 20 06 

Hopkins et al. 2008 [ ). So, to answer the question of 



where, today, are the super-dense objects seen at high-z, 
it has been suggested that they must have undergone 
dissipationless mergers since then, so as to have gone 
unnoticed today. (Cimatti et al. 2008 note that there is 
a class of local objects which may be direct descendents 
of the high-z samples for which the merger hypothesis is 
unnecessary - these are the fast rotators in the sample of 
Bernardi et al. 2008 which have large a but small sizes. But 
these objects are too rare to be the typical descendents. 
Recently, Trujillo et al. 2009 also found a very low number 
density of old superdense massive galaxies in the present 
Universe) . 

However, Fan et al. (2008) have shown a model which 
may be able to reproduce the observed evolution in the 
size— Af, relation. They assume that the AGN feedback 
which expels gas from the central regions produces a sudden 
reduction of mass in the core, as a result of which the stellar 
distribution puffs-up. In this model, the sizes increase, but 
the stellar masses do not. Since the peak of AGN activity 
was around z ^ 2, most of the size evolution in this model 
is over by z ~ 1. In contrast, dry mergers and evolution in 



hierarchical models is expected to continue to the present 
day (|De Lucia et al. 20061 1 Almeida et al. 2007|) . 

Most observational studies have concentrated on 
objects at z > 1. van der Wei et al. (2008) measured 
the size of a sample of massive early-type galaxies at 
z ^ 1 and found that they are about a factor of two 
smaller than at z ~ 0. One of the main goals of this 
paper is to investigate the possibility that the sizes 
are evolving even at small redshift {z < 0.3), pay- 
ing particular attention to those objects for which dry 
dissipationless merging is thought to have been most 
common - early-type BCGs l|Malumuth fc Kirshner 19811 
Malumuth fc Kirshner 1985] [Oegerle fc Ho csscl 199l| 

iLaucr ot al. 20071 IBernardi et al. 20071 ITran et al. 2008^ 
Late-type BCGs, and, indeed, late-type galaxies, are not 
studied in this paper, so we will often not bother to specify 
the qualifier 'early-type' when we discuss BCGs (and 
similarly, when we discuss non-central galaxies). 

Early-type BCGs at 0.4 < z < 1 have been compared 
with more local samples recently. One study concludes that 
the stellar mass appears to not have grown significantly 
since z ^ 1 ( |Whiley et al. 2008 1 , but does not include a 
study of the BCG sizes. Another suggests that the sizes 
may have been smaller at high redshift, but some of this 
apparent evolution was almost certainly a consequence of 
not looking at fixed restframe wavelength or stellar mass 
([Nelson et al. 20021 . 

Section[2]describes our early-type BCG sample in which 
we have size, velocity dispersion, stellar mass and age esti- 
mates out to z ~ 0.3. In Section [3] we show that, at fixed 
(evolution corrected) luminosity BCGs were larger in the 
past, and they had smaller velocity dispersions. This remains 
true if we replace luminosity with stellar mass, although in 
this case the measurement is complicated by selection effects 
(as we illustrate in Appendix A, where we also discuss how 
correlated errors in stellar mass and age can compromise the 
observed correlations) . We also show that the objects which 
formed earlier are smaller, but their velocity dispersions are 
not larger - whereas the former is expected in monolithic 
collapse models the latter is harder to arrange. 

In Section [4] we compare the sizes and ages of BCGs 
to those of other (early-type) cluster galaxies. This com- 
plements recent work indicating that non-central or satel- 
lite luminosities should be approximately independent of 
the mass of a group or the luminosity of the central BCG 
([Skibba et al. 20061 [Skibba et al. 2"007|) . 

A final section summarizes our results. Appendix B dis- 
cusses how our luminosity-size correlation for BCGs com- 
pares with other recent determinations. 

Complementary studies of the sizes and velocity dis- 
persion of the bulk of the local early type population are 
presented in Shankar fc Bernardi (2009) and Shankar et al. 
(2009). These other studies show that the size- luminosity 
relation of galaxies with Lr < 10^^ Lq at z ~ is indepen- 
dent of the age of the stellar population, implying that the 
size-stellar mass relation does depend on age: older galaxies 
are smaller than younger galaxies of the same stellar mass. 
However, they find a weak dependence of the velocity dis- 
persions (at fixed mass) on galaxy age - this is difficult to 
accomodate in a monolithic-based collapse model. The am- 
plitude of these trends for massive galaxies (Lr > lO'^^ Lq) 
is even smaller. 
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As discussed below, we show that early-type 
BCGs/massive galaxies are different, and have evolved 
differently, at least at small lookback times, from the bulk 
of early-type galaxies; their properties are more consistent 
with formation from predominantly dissipationless mergers 
than from a monolithic collapse. 



2 THE SAMPLE 

In what follows, we will study the luminosities, sizes, veloc- 
ity dispersions and stellar masses of early-type BCGs iden- 
tified in the SDSS. Spectra are available for objects with 
rrir < 17.7, whereas the photometry allows galaxies to be 
reliably identified down to about 4 magnitudes deeper. The 
photometric quantities (magnitudes, half-light radii) we use 
in what follows are not exactly the same as those output by 
the SDSS DR6 database. Rather, we use the prescriptions 
(Equations 1 - 4) in Hyde & Bernardi (2009) to correct the 
SDSS parameters for known sky subtraction problems with 
the photometry of bright objects in crowded fields. Typi- 
cally, the corrected magnitudes are slightly brighter and the 
sizes correspondingly larger (there is a well-known correla- 
tion for correlated errors in early-type galaxy photometry 
which has re//e * ^constant). The velocity dispersions are 
measured through a fiber of radius 1.5 arcsec; they are then 
corrected to r^/S (as is standard practice). The size and ve- 
locity dispersion can be combined to estimate a dynamical 
mass; we do this by setting Mdyn ~ ^R^a^ /G. For a subset 
of objects, stellar masses, and luminosity-weighted age esti- 
mates are available from Gallazzi et al. (2005). Throughout, 
angular diameter and luminosity distances were computed 
from the measured redshifts assuming a Hubble constant of 
Hq = 70 km s~^ Mpc~^ in a geometrically flat background 
model dominated by a cosmological constant at the present 
time: (fio, Ao) = (0.3,0.7). 

We would like to study early-type BCGs over a rela- 
tively large range in redshift. Since we are most interested 
in evolution, we would like the sample to be homogeneously 
selected over the entire redshift range. This motivates the 
use of the MaxBCG catalog which was assembled by Koester et 
al. (2007) from the SDSS. It spans the range 0.07 < z < 0.3; 
the 13823 groups and clusters in it each contain at least 
10 galaxies brighter than 0.41/, in the r-band (i.e., brighter 
than about —19.5 mags in r). 

At the very least, we would like to study BCG lumi- 
nosities and sizes over this redshift range, but, if possible, we 
would like to study velocity dispersions and stellar masses as 
well. Of 13823 clusters in the MaxBCG catalog, 5413 are re- 
ported to have BCGs with spectroscopic redshifts. However, 
upon matching with the SDSS-DR6 (Adelman-McCarthy et 
al. 2008), we find 7832 objects with spectroscopic redshifts. 
A subset of 4912 has deVaucouleur apparent magnitude be- 
tween 14.5 and 17.5. Requiring fracDev>0.8 in both the 
g- and r-bands is a good way to select early-types; this re- 
duces the sample size to 4350. Of these, 2634 have stellar 
mass estimates from Gallazzi et al. (2005), and only 2012 
of these have estimated velocity dispersions. For the objects 
with stellar mass estimates, Gallazzi et al. (2005) also pro- 
vide (luminosity weighted) age estimates. The results which 
follow that are based on L rather than M, do not depend 
on whether or not we used 4350 or 2634 galaxies. (Requir- 
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Figure 1. Redshift distribution of BCGs from the C4 (thin solid) 
and MaxBCG (thick solid) catalogs. Dot-dashed histogram shows 
the redshift distribution of the objects we identify as satellites, 
and dashed histogram shows the redshift distribution of the sub- 
set of BCGs which host these satellites. 
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Figure 2. Luminosity distribution of early-type MaxBCGs with 
early- type satellites (solid), and satellites (dashed) over the range 
O.Kz < 0.15. 

ing that velocity dispersions were estimated but not caring 
about M* makes the sample size 3277.) 

We will also be interested in how BCGs compare to 
non-central or satellite galaxies. We do this by search- 
ing an appropriately chosen volume around each MaxBCG 
in our sample as follows. The catalog contains an esti- 
mate of the number of galaxies, A^gai in the group as- 
sociated with each BCG. We define a velocity dispersion 
(Jgroup = 100 -y/LSSlVgli' km s~^ and a radius r group = 
group /T I'JQ Mpc. The relation between group radius and 
velocity dispersion is chosen to approximately match the 
expected scaling for dark matter halos. The resulting scal- 
ing between group velocity dispersion and Ngai is close to 
that reported by Becker et al. (2007) in their analysis of this 
catalog. We identify as satellites all objects which are within 
r group across and 2agroup along the line of sight to a BCG in 
the catalog. Having identified the satellites, we would now 
like to exclude those that are of later type. We do so by 
applying the same selection cuts as we did to identify early- 



4 M. Bernardi 



type BCGs. This leaves a sample of 1734 objects - less than 
half the full sample of satellites. (If the BCG does not sit 
at the center of its cluster, then we will be making errors of 
inclusion and exclusion at the cluster boundaries; but since 
we are already making approximations about which objects 
along the line of sight are true members, which lead to qual- 
itatively similar errors, we do not attempt to correct for this 
effect.) 

Note that our satellite sample is smaller than the BCG 
sample; this might seem in conflict with our previous state- 
ment that each maxBCG cluster has at least 10 satellites 
brighter than 0.4Lt. This is because we are requiring that 
satellites have measured spectra; because of the SDSS mag- 
nitude limit, spectra of 0.4L* objects are not available be- 
yond z ~ 0.05. For example, of the 1734 satellites in our sam- 
ple, 1555 have Mr < —22. Thus, the vast majority of these 
objects are more luminous than 3Lt - they represent the 
bright-end of the satellite luminosity function. In addition, 
if an SDSS fiber was placed on a BCG, then objects within 
55 arcsecs of it will not have an SDSS spectrum. These fiber- 
collisions affect a larger physical scale for the higher redshift 
clusters, further reducing the number of satellites with mea- 
sured spectra. Note that this means we tend to miss those 
satellites which are closest to their BCGs - dynamical fric- 
tion arguments suggest that these may be amongst the most 
massive satellites in each cluster. 

Johnston et al. (2007) note that about 20% of the BCGs 
are misidentified, and that this fraction increases at lower 
Ngai- In such cases where a satellite has been classified as 
a BCG, fiber collisions almost certainly prevent the BCG 
from having a spectrum and so entering our sample of satel- 
lites. This will serve to make the BCG sample more like the 
satellites, but not vice versa - a point we return to later. 

At lower redshifts, we use the C4 catalog of Miller et 
al. (2005). Although this is a rather different catalog, we 
show below that if we extrapolate the trends we see in the 
MaxBCG catalog to smaller z, then they are in good agree- 
ment with those in the C4 catalog. (The most important 
difference is that the MaxBCG algorithm searches for a red 
sequence in the photometric sample, whereas the C4 method 
simply searches for objects with similar colors in the spec- 
troscopic sample, so it does not select against groups of blue 
galaxies. Since we only use the early-type BCGs from this 
catalog anyway, the real difference is that, because the C4 
method is based on the spectroscopic sample, it may miss 
the true BCG because of fiber collisions. This is a known 
problem, for which the catalog provides a flag. We only use 
the subset of C4 clusters for which the photometric and 
spectroscopic BCG are the same.) 

Furthermore, there is previous work on the C4 BCG 
catalog; studying this catalog allows us to tie our measure- 
ments to those in the literature (see Appendix B). This is 
important because, although all the objects we analyze are 
drawn from the SDSS database, the photometric quantities 
(magnitudes, half-light radii) are not exactly the same as 
those output by the survey. Rather, we use the prescriptions 
in Hyde & Bernardi (2009) to correct the SDSS parameters 
for known sky subtraction problems with the photometry of 
bright objects in crowded fields. 

Figure [T] shows the distribution of early- type BCG red- 
shifts in the C4 (solid) and MaxBCG (MaxBCG-DR6spec, 
solid) catalogs (the C4 BCGs probe lower redshifts), the 
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Figure 3. Size-luminosity relation for the bulk of the population 
(dashed) and BCGs (dot-dashed). Filled symbols show this rela- 
tion when the sample is restricted to a narrow range in a: results 
for 6 bins of width 0.1 dex, starting from logjQ a = 1.9, are shown. 
Open circles show a similar analysis of the BCGs: 4 bins of width 
0.05 dex, starting from logjQ a = 2.3. 

redshift distribution of MaxBCG early-type satellites (dot- 
ted), and the redshift distribution of early- type MaxBCGs 
with early- type satellites (MaxBCG- withSatel, dashed). The 
satellite distribution does not extend to as high redshifts, be- 
cause satellites are fainter than BCGs (by definition). The 
dashed histogram shows the redshift distribution of those 
BCGs which have satellites with spectroscopic information. 
Most of the high redshift BCGs have no satellites, due to the 
combined effects of the magnitude limit and fiber-collisions. 
However, over the redshift range 0.1 < z < 0.15 there are 
approximately three satellites per BCG; we will only use 
the objects in this range when we compare BCG and satel- 
lite properties. Figure [2] shows the distribution of (evolution 
corrected) luminosities over this range: the mean BCG lu- 
minosity is about half a magnitude brighter. 



3 THE SIZE-LUMINOSITY AND 

SIZE-STELLAR MASS RELATIONS 

The main goal of this section is to present measurements of 
the correlation between (restframe) size and (evolution cor- 
rected) luminosity in a few redshift bins. (The measured 
luminosities are corrected for known problems associated 
with the SDSS sky subtraction algorithm following Equa- 
tion 2 in Hyde & Bernardi 2009. They are then corrected 
for evolution, by adding 0.92: to all absolute magnitudes. 
The sizes are corrected for the known sky subtraction prob- 
lems and for the fact that early-type sizes depend on wave- 
length following Equations 3 and 6 in Hyde & Bernardi 2009, 
respectively.) Inferences about evolution in this correlation 
depend upon how much one believes that the effects of lu- 
minosity evolution have been removed. For this reason, one 
might have thought it preferable to study the correlation 
between size and stellar mass. In what follows, we will show 
these relations side-by-side, but emphasize that because the 
SDSS is magnitude limited, evolution in correlations with 
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Figure 4. Size-luminosity and stellar mass relations for BCGs in the C4 (stars) and MaxBCG (open squares) samples (top), and 
similarly when size is replaced by velocity dispersion (bottom). Filled circles show the median value and its uncertainty for the bulk of 
the early-type population; thin and thick black solid lines show the linear and quadratic fits from Table 1 of Hyde &; Bernardi (2009), 
respectively. Dashed and dotted curves show the regions which enclose 68% and 95% of the objects. At fixed L or M*, the C4 BCGs are 
larger than MaxBCGs, and both are larger than the mean relation traced by the bulk of the population. The objects with largest sizes 
have the smallest velocity dispersions. 



M* should be interpretted carefully. Appendix A discusses 
why. Appendix B compares the BCG size-luminosity rela- 
tion we find here with other determinations in the recent 
literature. 

Note that although our M« estimates are from Gallazzi 
et al. (2005), we correct them slightly to account for the 
fact that they actually come from multiplying an estimate 
of M* /L by the observed luminosity. The L used by Gallazzi 
et al. did not account for the SDSS sky subtraction problem, 
so we divide their M« estimates by the same correction factor 
we used for the magnitudes (Equation 4 in Hyde & Bernardi 
2009). 

3.1 Abnormally large sizes 

The sizes and luminosities of early-type galaxies are corre- 
lated: the mean size increases with luminosity as _R oc L^'^ 
(e.g. Hyde & Bernardi 2009). However, BCGs follow a 
steeper relation: R cc L (see Appendix B), and it has long 
been argued that this is evidence for formation histories 
that are dominated by dry mergers. The argument is not so 



straightforward, however. This is because objects with small 
values of a are not BCGs, so BCGs have a narrower distri- 
bution in cr than the bulk of the population. However, for 
the bulk of the population, the R — L correlation at fixed cr 
is considerably steeper, R oc L"'^, than when averaged over 
all a (Bernardi et al. 2003; Bernardi et al. 2008). So one may 
ask if the steeper relation for BCGs can be attributed to the 
fact that they are biased to larger a. 



Figure [3] presents a direct comparison: the dashed and 
dot-dashed lines in the top panel show the Re — L relation 
for the bulk and for the early-type BCGs, and the various 
solid lines show this relation for narrow bins in cr: the lines 
are offset to smaller R^ as cr increases. The bottom panel 
shows these relations after subtracting-off the dashed line. 
This shows clearly that, except for the smallest bin in cr, 
the other relations are all steeper. However, the BCGs are 
steeper still: the Re — L relation of BCGs is steeper than 
that of the bulk, even at fixed cr. In the next subsections, we 
study other evidence that BCGs are a different population. 
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Figure 5. Same as previous figure, but now for objects from the MaxBCG sample only, subdivided into redshift bins as indicated. At 
fixed luminosity or stellar mass, the BCGs in the lowest redshift bin are larger and have smaller velocity dispersions. 



3.2 Evidence for evolution 

The top panels in Figure |4] compare the size-luminosity 
and stellar mass relations for the BCGs in the C4 and 
MaxBCG catalogs, with the relations traced out by the 
bulk of the early-type galaxy population. At fixed L, the 
C4 BCGs are larger than MaxBCGs, and both are larger 
than the mean relation traced by the bulk of the population. 
These differences are most pronounced for the most lumi- 
nous objects; there is essentially no effect at Air > —22.5 or 
logio(M./Mo) < 11.2. 

The bottom panels show that complementary differ- 
ences are seen when size is replaced with velocity dispersion: 
at the brightest luminosities (Mr < —23) C4 BCGs have the 
smallest velocity dispersions. While the trends at the bright 
end are the ones of most interest in the present context, we 
note that, at fainter luminosities, BCGs tend to have larger 
a for their L than the bulk of the population. 

That BCGs have larger sizes and smaller velocity dis- 
persions than the bulk is no surprise - what ts surprising is 
the significant difference between the two BCG samples. Al- 
though it is possible that this is related to the fact that the 
two samples span different redshift ranges, it is also possible 
that systematic differences between how the catalogs were 
assembled are to blame. 



To eliminate the second possibility. Figure \S\ shows a 
similar analysis, but now restricted to MaxBCG objects 
only. Since this sample is relatively large, we divided it into 
subsamples in redshift: 0.07 < z < 0.12, 0.17 < z < 0.22 
and 0.25 < z < 0.30. This shows clearly that, even within 
the MaxBCG catalog itself, the lower redshift BCGs tend to 
have larger sizes and smaller velocity dispersions than their 
higher redshift counterparts of similar luminosity or stellar 
mass. Moreover, the MaxBCGs in the lowest redshift bin 
tend to follow similar scaling relations to those defined by 
the C4 BCGs. 

Finally, Figure |6] shows the i?e — L relation for the 
bulk of the early-type galaxy population. (Recall that the 
measured luminosities have been corrected for evolution 
by adding 0.9^ to all absolute magnitudes, and the sizes 
are corrected for the fact that early-type sizes depend on 
wavelength.) Each set of symbols shows data from a num- 
ber of redshift bins: 0.07 < z < 0.1, 0.1 < z < 0.13, 
0.13 < z < 0.16, 0.16 < z < 0.19, 0.19 < z < 0.22, 
0.22 < z < 0.25 and z > 0.25. To reduce the range of 
sizes, in the top panel we have subtracted out a fiducial rela- 
tion to better see if there is any evolution: we actually show 
Alogio^^e = logio(-Re/kpc) - (4.72 + 0.63 Mr + 0.02 M^) 
(from Table 1 of Hyde & Bernardi 2009). The top panel in 
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Figure 6. Residuals from the size-luminosity relation 
AlogioiJe = logio(-Re/kpc) - (4.72 -I- 0.63 Mr + 0.02M2), 
for the bulk of the early-type galaxy population in different 
redshift bins: 0.07 < z < 0.1, Q.l < z < 0.13, 0.13 < z < 0.16, 
0.16 < z < 0.19, 0.19 < z < 0.22, 0.22 < z < 0.25 and 
z > 0.25. At Mr > —22, there is a tendency for the objects at 
higher redshift to have slightly smaller cr. At Mr < —22, the 
difference in size between two different redshift bins increases 
for brighter galaxies, in agreement with the evolution seen for 
BCGs in Figure [5] Bottom panel shows the result of applying 
the corrections in Equations 1 and 3. 



Figure [7] shows a similar analysis of the velocity dispersions, 
for which Alogigcr = logio(cr/kms"^) - (-2.97 - 0.37 - 
0.006 Mr^) (from Table 1 of Hyde & Bernard! 2009). 

There is a hint that the higher redshift objects have 
smaller sizes. At Mr < —22, the difference in size between 
two different redshift bins increases for brighter galaxies, 
in agreement with the evolution seen for BCGs (Figure [S}. 
Thus, at the bright end {Mr < —22), we find that the evo- 
lution depends on the luminosity of the galaxy: the sizes 
evolve as (1 -I- a,nd the velocity dispersions as 

(1 + 2)-0-2(J^=f^+2i). At fainter luminosities {Mr > -22), the 
evolution is weaker; we approximate it as (1 -I- z)""'^ and 
(1 -|- z)°'^ . Hence, to correct the sizes and velocity disper- 
sions to 2: = one could use: 



logio 



ocorr 



kpc 



= log 



Re 



logi 



V kms 



= log 



> kpc 



10 



Vkms 



0.7(Mr + 21) log(l + zll) 
+ 0.2{Mr + 21) log(l +(2) 



Figure 7. Same as the previous figure, but now for the cr- 
luminosity relation: AlogjQcr = log]^g(o-/kms^^) — (—2.97 — 
0.37 Mr - 0.006 M^), for the bulk of the early- type galaxy popu- 
lation in different redshift bins. Bottom panel shows the result of 
applying the corrections in Equations 2 and 4. 



if Mr < -22 and by 



logio 



kpc 



= log 



log 



kms 



Re 



+ 0.71og(l + 2) 



(3) 



= logio -0.2 log(l + z) (4) 



if Mr > —22. The bottom panels in the two figures show the 
result of applying these corrections. We show below that the 
scaling at the bright end is slightly smaller than the lumi- 
nosity dependent evolution in the sizes of our BCG sample 

which goes as (1 + 2)0-85(M.+21)^ 



3.3 Dependence on age and formation time 

If we are seeing evolution, then it is interesting to ask if this 
depends on the age or formation time of the stellar popu- 
lation. E.g., the simplest monolithic collapse models predict 
that, for fixed formation time, there should otherwise be no 
dependence on age. To address this, we use age estimates of 
the stellar populations in these galaxies (from Gallazzi et al. 
2005); these, with the observed redshifts, yield the lookback 
time to when the stars formed. (Because the age and M, 
estimates have significant uncertainties, and they are corre- 
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Figure 8. Same as Figure [5] but now the objects in each redshift bin are subdivided by the lookback time to when their stars formed. 
At fixed formation time, objects at low redshift have larger sizes and smaller velocity dispersions than their higher redshift counterparts 
of similar M» (left) or L (right). 



lated, it is important to use ages that are output from the 
same models which estimate M*.) 

Figure |H] is similar to Figure [S] but now the objects 
in each redshift bin have been divided into two subgroups, 
based on the formation time of the stars. These two bins 
correspond approximately to Zform ~ 2.5 (solid lines) and 
1.25 (dashed lines). 

We begin with a comparison of the solid lines in the pan- 
els on the left. These show that, for fixed formation time, 
older objects have larger sizes (top left) and smaller velocity 
dispersions (bottom left) than younger objects of the same 
stellar mass. The same is true of the dashed lines in these 
panels: size increases and velocity dispersion decreases as 
the galaxy population ages. The increase in size is more eas- 
ily accomodated in dry merger models, and the decrease in 
velocity dispersion suggests that these mergers were minor. 

Comparison of the dashed and solid curves for a given 
redshift of observation shows that, for a given stellar mass, 
the objects which formed more recently have larger sizes. 
While this is qualitatively consistent with having formed 
when the Universe was less dense, the difference is much 
less than the factor of log]^o(3-5/2.25) = 0.2 dex one might 
naively have expected. Similarly, the velocity dispersions of 
the objects which formed more recently are not much smaller 



than when the formation redshift was higher. Presumably, 
this is because the sizes of the older objects have increased 
from their initial values, and the velocity dispersions have 
decreased (as suggested by comparing the solid lines with 
one another, and the dashed lines with one another). 

There are important qualitative differences when one 
uses luminosity rather than Af, , meaning that care must 
be taken when translating trends seen in plots with L into 
trends with M*. The top right panel shows that, at a given 
redshift of observation, the size-luminosity correlation does 
not depend on formation time (solid and dashed lines over- 
lap), and the a — L relation does not either (bottom right 
panel). This may be understood as follows. The panel on 
the right is obtained by shifting each galaxy in the panel on 
the left by its {Mt,/L)~^. In a model where the stars age 
passively (whatever the assembly history), the older popu- 
lation has a larger M^/L: the expected difference in M^/L 
between the two age bins is about 0.1 dex {M^/L oc t"'^^ or 
so, where t is the age of the population). So, if we start from 
the Re — M, relation, then the solid curves in each redshift 
bin should shift towards the left, bringing them closer to the 
dashed ones. 

Finally, Figure [9] shows the effect of correcting the sizes 
and velocity dispersions of our early-type BCGs using an 
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Figure 9. Same as right-hand panels of previous figure, but now 
after correcting the sizes and velocity dispersions for evolution 
using Equations 1 and 2. 



equation similar to Equation ([T]) (since the size evolution 
of BCGs is better described by (1 + 2)0 85(a^^+2i) re- 
place 0.7 with 0.85) and Equation ((2)1. These corrections 
bring the curves associated with diflterent redshifts into bet- 
ter agreement, suggesting that they have captured most of 
the evolution. 

We conclude that, at fixed luminosity or stellar mass, 
the high redshift BCGs are denser, with the effect being 
more pronounced for objects with the largest luminosities 
or stellar masses. The most straightforward interpretation 
of this observation is that the sizes and velocity dispersions 
of luminous BCGs are evolving in a manner which is qualita- 
tively consistent with assembly histories that are dominated 
by dissipationless mergers which are still happening at low 
redshift; the large size of our sample has allowed a detection 
of this even though it spans only a small lookback time. 

Before we conclude this section, we emphasize again 
that one must be cautious when replacing luminosity with 
stellar mass, since selection effects can complicate the mea- 
surement (as we illustrate in Appendix A). Working with 
age and M, presents additional complications because er- 
rors in age and M^/L are correlated (see Appendix IA2I for 
further discussion). 



4 CENTRALS AND SATELLITES 

Early-type BCGs tend to have larger sizes than the bulk of 
the early-type galaxy population. So one might wonder if the 
large sizes of BCGs are something that is characteristic of 
the group/cluster environment, or if this is specific to BCGs. 

To address this. Figure [10] compares the scaling rela- 
tions of the objects we identified as early-type satellites, 
with those for early-type BCGs and for the bulk of the 
early-type population. This comparison indicates that the 
non-central/satellite early-types tend to be very similar to 
the bulk of the early-type population; it is the BCGs which 
are different. At \ogii^{M, /Mq) > 11.4 they have unusually 
large sizes with the effect increasing at large M«; at lower 
masses (logio(Af,/M©) < 11.4), there is a hint that BCG 
velocity dispersions are larger than those of satellites and 
of the bulk of the early-type population (a hint also seen in 
Figure O • 

Our findings appear to contradict those of Weinmann 
et al. (2008) who report that the sizes of early-type central 
galaxies are not larger than the sizes of early-type satellites 
of the same stellar mass (see their Fig. 4). We suspected 
that some discrepancy arised because they used Petrosian- 
based quantities which are ill-suited for this sort of analysis 
( |Hyde fc Bernardi 2009[ ). However, more recently Cuo et al. 
(2009) have fit Sersic profiles to a subset of the same sample 
of galaxies and found a similar result: no difference in the 
size of central and satellite early-type galaxies of the same 
stellar mass. The difference between our results and theirs 
may be explained by the fact that they studied groups which 
are less massive than ours and so they do not have satellites 
at the high mass end where we find the differences to be 
most significant. 

The ratio of dynamical to stellar mass is another quan- 
tity of recent interest ( [Hyde fc Bernardi 2009[ ); it increases 
at large masses, suggesting that star formation is ineffi- 
cient at large mass. Figure [TT] shows that this ratio is about 
0.05 dex larger for BCGs than it is for the bulk of the pop- 
ulation of the same luminosity, whereas the satellites are 
similar to the bulk of the population. At the bright end 
(A/r < —22.8), the difference is primarly due to the differ- 
ences in sizes - the velocity dispersions of BCGs are simi- 
lar to those of satellites of the same luminosity. In a model 
where BCGs formed from dissipationless mergers this is eas- 
ily understood: the offset to large Mdyn/M* is associated not 
with lower star formation efficiency, but with the subsequent 
assembly of the stars which has increased the sizes more than 
the velocity dispersions. At the faint end {Mr > —22.5), the 
difference is probably due to the velocity dispersions rather 
than to the stellar masses. 

We have also studied the correlation between the ages 
and luminosities or stellar masses of BCGs and satellites. 
(Appendix A discusses why, at smaller A/* than we show 
here, the trends with stellar mass may be strongly affected 
by selection effects - see Figure IXSl ) Figure [T2l shows that at 
luminosities of about L, (i.e., ~ —21.2 mag in the r— band) 
and larger, the bulk of the population defines an age— Af* 
or age— L relation: massive or more luminous galaxies tend 
to be slightly older. There is a hint that, about a magni- 
tude brighter than L*, BCGs are slightly older (~ 0.5 Gyr) 
than other objects of the same luminosity; but there is no 
difference at brighter or fainter L, and there is no difference 
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Figure 10. Comparison of sizes and velocity dispersions of BCGs and non-central/satellite cluster galaxies of similar luminosity and 
stellar mass, in a narrow redshift bin. 



when compared with objects of the same M« . Satellites and 
BCGs (whether or not they have satellites) follow the same 
age— M* relation as the bulk of the population. This is re- 
markable, given that we see this trend over a redshift range 
where the counts of centrals and satellites are approximately 
proportional to one another (Figure[T]). Given that the BCGs 
in the same volume are more luminous (Figure [2}, and have 
larger stellar masses on average, one might have expected 
the BCGs to also be older. Our results indicate that, if they 
have the same stellar mass, satellites and BCGs have the 
same age. 



To understand why this happens, we now compare the 
luminosity-weighted age difference between satellites and 
their BCG, as a function of BCG luminosity. Figure 1131 
shows that BCGs tend to be older than their satellites, by 
about 0.5 — 1 Gyrs, over about two magnitudes in BCG lu- 
minosity - a range over which the mean BCG age changes 
by about 1 Gyr. (Our results are unchanged if we use the 
number- rather than luminosity- weighted age difference.) 
The younger ages with the smaller luminosities conspire to 
keep the satellites and BCGs along the same age-L or age- 
M* relations. 
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Figure 11. The dynamical to stellar mass ratio versus absolute 
magnitude of BCGs and non-central/satellite cluster galaxies of 
similar luminosity and stellar mass, in a narrow redshift bin. 



5 DISCUSSION 

Early-type BCGs have larger sizes than other early-type 
galaxies of similar luminosity or stellar mass. If restricted to 
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Figure 12. Correlation between lookback time to formation and stellar mass (left) or luminosity (right) for BCGs (squares) and satellites 
(triangles), over the redshift range where the number counts differ by a factor of about 3. Filled circles show the locus traced by the bulk 
of the early-type population; dashed and dotted lines show the region containing 68% and 95% of the objects. 



a narrow bin in velocity dispersion, the size-L relation of the 
bulk of the population is steeper, suggesting that perhaps 
it is the fact that BCGs are biased towards larger veloc- 
ity dispersions that is the origin of this difference. However, 
at fixed a, the BCG Re ^ L scaling relation is steeper still 
(Figure [S]). 

Moreover, the sizes of BCGs appear to be evolving: 
higher redshift BCGs had smaller sizes than their local coun- 
terparts of the same (evolution corrected) luminosity or stel- 
lar mass (Figure [5]). The evolution in the Re — L relation of 
the early-type BCG population (and in general of galaxies 
with ~ Mr < —22) is more evident than that of the bulk 
of the early-type population at fainter luminosity: the BCG 
sizes evolve as (1 -I- 2;)0-85(m,+2i) (pigm-es [gjl . The evolution 
in the a — L relation over this same period suggests that 
velocity dispersions evolve as (1 + 2;)~''-2(*^'-+2i) . 

For the bulk of the early-type population, the sizes of 
bright objects {Mr < -22) evolve as (1 + z)°-''^^^+^^^ and 
the velocity dispersions evolve similarly to BCGs (Figures [6] 
and [7|. The scaling for the bulk of the early- type popu- 
lation that we see from studying small lookback times is 
consistent with that reported by van der Wei et al. (2008) 
from a comparison of 2; = with z — 1 objects: they find 



that the sizes evolve as {1 + z) 



for objects brighter 



than Mr —22 (they did not study luminosity or mass- 
dependent trends). 

At fainter luminosities {Mr > —22), the evolution is 
weaker; it goes as (1 -I- z)""'^ and (1 -|- z)^'^. See Shankar & 
Bernardi (2009) for a more detailed analysis of the bulk of 
the population, who conclude that the more massive galaxies 
{Lr > 10^^ Lq) show stronger evidence of the effects of 
dissipationless mergers. 

The cleanest tests of the evolution we see come from re- 
stricting the BCG sample to narrow bins in formation time. 
At fixed formation time and stellar mass, the objects ob- 
served at lower redshift are larger, and their velocity dis- 
persions are smaller (compare solid curves in Figure [8]). The 
evolution, which we detect over lookback times as small as 
1 Gyr, is difficult to reconcile with the simplest monolithic 
collapse models, and is most pronounced for the most lumi- 
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Figure 13. Difference between the mean luminosity-weighted age 
of the satellites in a group, and that of the BCG, as a function of 
BCG luminosity. 



nous objects. The recent growth in BCG sizes is in qualita- 
tive agreement with hierarchical galaxy formation models in 
which the assembly of BCGs continues to the present day 
iPe Lucia et al. 20061 1 Almeida et al. 200711 . 

Figure |8] also shows that the objects which formed ear- 
lier are smaller, but their velocity dispersions are not larger 
(compare dashed with solid curves in Figure |SJ . Whereas 
the former is expected in the simplest monolithic collapse 
models - the universe was denser at high redshift, so one 
expects the younger objects of a given mass to be smaller 
and have larger velocity dispersions - the latter is harder to 
arrange. Thus, both trends in Figure [8] - the evolution of 
the sizes and velocity dispersions of objects (of fixed stellar 
mass) that formed at the same time, and the dependence of 
the sizes and velocity dispersions on formation time - are 
difficult to accomodate in the simplest monolithic collapse 
models. 

Recently, Fan et al. (2008) have suggested a pujfing- 
up scenario for the evolution in the size— M* relation — it 
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postulates that it is the sizes which evolve, not the stel- 
lar masses. This model exploits the fact that the super- 
dense galaxies mentioned in the Introduction are observed 
at about the epoch at which QSOs are most active; feedback 
from the AGN activity at z ~ 2 or 3 is assumed to expel 
gas from the central regions. The sudden reduction of mass 
in the core makes the surrounding stellar distribution puff 
up, after which the objects settle down to new (larger) sizes. 
This is expected to have been completed by z ^ 1, whereas our 
observations of evolving sizes are at low redshift, so it seems 
unlikely that this mechanism can explain our measurements. 
Also, there is little evidence for recently outflowing gas in 
our BCG sample. Nevertheless, we note that if the stellar 
masses have not changed, then Figure |8] suggests that at 
logj^Q(M*/M0) > 11.5, the sizes have increased by a factor 
of 1.5, and the velocity dispersions have decreased by a fac- 
tor of 1.15, between z — 0.27 and z — 0.09. In addition, the 
dependence of the size and (especially) velocity dispersion of 
BCGs on formation time (at a given Af,) predicted by Fan 
et al. (AlogiQ _Re > 0.2 and Alogj^Q a > 0.1) is significantly 
larger than what we observe in Figure |5] (compare dashed 
with solid curves for a given redshift). 

These results (i.e. the evolution in size and veloc- 
ity dispersion at small lookback times, and the weak 
dependence on formation time of the a — L relation) 
are more consistent with models which assume that 
galaxies formed from predominantly dissipationless merg- 
ers (|Malumuth fc Kirshner 19851 [Robertson et al. 20061 
[Boylan-Kolchin et al. 2006] [Hopkins et al. 2008[ ). In such 
models, the stars formed in gas rich mergers at high redshift 
(explaining the small sizes at early times), but were assem- 
bled into BCGs at later times by gas-poor, dissipationless 
mergers. In these models, the stellar masses, the sizes and 
the velocity dispersions can evolve, and the question arises 
as to whether the mergers were major (approximately equal 
size pieces) or minor. Our results suggest that the recent 
mergers for BCGs were minor. This is because equal mass 
mergers have the growth in stellar mass approximately the 
same as the growth in size, with little change in velocity 
dispersion iCiotti 2008). However, our Figure [8] shows that 
the velocity dispersion decreases. 

If the mergers are minor, and the mass increases by a 
factor of (1 + /) in each merger, where / ^ 1, then the 
size increases by (1 -I- 2/) and decreases by (1 — /) in 
each merger. In this case, a given change in size implies a 
smaller change in mass than if the changes were caused by 
a major merger. In this case, we can estimate a required 
change in mass and size by sliding the 7?e — M« relation at 
z — 0.27 upwards and to the right until it sits above the 
z ~ 0.1 relation: this suggests that / ~ 0.4. The predicted 
evolution in the a — M, relation can now be compared with 
that observed; while it is in the right sense, it is a little too 
strong. The problem can be alleviated slightly if we account 
for the fact that the stellar mass estimates at low redshift 
are slightly smaller than they should be, because of mass 
losses associated with stellar evolution. If so, the implied 
growth in mass agrees well with that expected in the hier- 
achical models (|De Lucia et al. 20061 1 Almeida et al. 2007p . 
However, Almeida et al. (2007) predict that the velocity dis- 
persions of luminous red galaxies (most luminous early-type 
BCGs are LRGs) were smaller, not larger, in the past. 

Note that a 0.4:1 merger is not what we would call mi- 



nor; we are supposing that the mass increase of 40% was 
due to a sequence of minor mergers (e.g. four mergers each 
adding ~10% to the mass and increasing the size by ~20%). 
In this context, it is also interesting that motion along an 
Re (X M» line (major mergers) cannot bring the superdense 
galaxies recently seen at 2 ~ 2 onto the z ~ 7?e — Mt rela- 
tion. However, minor mergers (motion along a line of slope 
2 in the log(_R) — log(M«) plane) may bring them onto the 
local relation traced by BCGs. The objects at 2 ~ 2 have 
large A/* even by z ~ standards, so it is not implausible 
that they are the progenitors of today's BCGs. This requires 
mass growth factors of order 4 or 5 (0.6 dex), coupled with 
an increase in size by a factor of order 10. This is consis- 
tent with our estimate of the observed BCG size evolution 
(1 + 2)0 S5(A^r+2i). getting Mr ~ -23.5 and z = 2 we get an 
evolution of a factor of ~ 10. 

Recent results support our conclusion that minor merg- 
ers are important. Hopkins et al. (2008) suggest that if merg- 
ers are major, then the fraction of superdense massive galax- 
ies which survived intact since their formation at 2 > 2 could 
reach 1 - 10%. However, Trujillo et al. (2009) show that the 
actual number density of superdense galaxies at 2 ~ is 
much smaller (the few which do exist appear to be young, 
so they are unlikely to be descendents of the 2 ~ 2 objects). 
Minor mergers must account for the difference. 

Minor mergers are also prefered because constraints 
from the bright end of the luminosity function suggest little 
evolution since 2 ~ 1 (e.g. Wake et al. 2006; Brown et al. 
2007; Cool et al. 2008), even though cluster masses are ex- 
pected to have grown substantially, through mergers, during 
this time (e.g. Sheth & Tormen 1999). Of course, as a re- 
sult of such mergers, the fractional mass growth of the BCG 
need not be the same as that of its cluster, since some of 
the added stellar mass must make the intercluster light (e.g. 
Skibba et al. 2007) . And indeed, comparison of the cluster- 
ing of the most luminous galaxies at 2 ~ 0.7 with that more 
locally suggests that cluster merging has resulted in some 
stellar mass growth of the BCGs (White et al. 2007; Wake 
et al. 2008) ; while massive halos double in mass over the last 
7 Gyr, the stellar masses of their BCGs are expected to have 
changed by about 30% (Brown et al. 2008). 

If this estimate of the mass change is accurate, then ma- 
jor mergers cannot account for the factor of 2 change in size 
which van der Wei et al. (2008) report is typical for massive 
early-types over this time - and which our measurements 
suggest is even more dramatic for BCGs. On the other hand, 
minor mergers are better able to reconcile the observations 
of dramatic size evolution with little mass evolution. Indeed, 
Bournaud et al. (2007) have recently highlighted the fact 
that multiple minor mergers may be the dominant channel 
for early- type galaxy formation. Simulations show that, for 
such a merger, the fractional increase in size can be larger 
than that of the mass (e.g.. Halo A in Naab et al. 2007), 
consistent with our simple analytic estimate above. 

We also compared the ages and sizes of our early-type 
BCGs with other cluster galaxies (satellites). BCGs are 
larger than early-type cluster galaxies of similar luminosity 
or stellar mass at the same redshift (Figure [10)) . Although 
satellites and BCGs trace the same weak age-L or age— A/« 
relation (Figure [12)) . this can be understood by noting that 
BCGs are typically about 1 Gyr older than the satellites in 
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Figure 14. Difference between tlie iuminosity of a satellite galaxy 
and that of its BCG, as a function of BCG luminosity. 

their group (Figure [13]), and they are about 0.5 mags more 
luminous (Figure [2)). 

The mean satellite luminosity is approximately 
independent of BCG luminosity - a prediction 
HSkibba et al. 2006)) which has recently been confirmed from 
measurements in other group catalogs (jSkibba et al. 20071 
[Hansen et al. 2007p . Figure [141 shows that this remains true 
if both BCGs and satellites are required to be early-types. 
Skibba & Sheth (2008) went on to suggest that satellite 
colors should also be much weaker functions of group mass 
than are the colors of centrals, and Skibba (2009) showed 
that this was indeed the case. Since color is an indicator of 
Mt/L, and satellite L is almost independent of group mass, 
M* should be similarly independent. This prediction has 
now been confirmed: van den Bosch et al. (2008) find that 
M« for satellites changes by a factor of only 2 over a range 
where group mass changes by a factor of 100. For similar 
reasons, one expects only a small increase of mean satellite 
age with BCG luminosity; this is qualitatively consistent 
with the constant age offset between satellites and BCGs 
in our Figure 1131 and the weak age-luminosity relation for 
BCGs in our Figure [H 
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APPENDIX A: EFFECT OF THE MAGNITUDE 
LIMIT 

The SDSS is magnitude limited. As a result, care must be 
taken when interpretting redshift-dependent trends. In gen- 
eral, accounting for the selection effect is only straightfor- 
ward for correlations with luminosity; correlations with stel- 
lar mass may be strongly affected - even though M« and L 
are tightly correlated. Things are even more complicated if 
one wishes to study age related trends, since the errors on 
the age estimates are correlated with those on M* , and these 
may be substantial. 

Al Correlations with M* 

To illustrate, the panel on the left of Figure lAll shows the 
Af« — L relation in a number of redshift bins; there is no 
trend with redshift. The standard way of accounting for the 
magnitude limit is to weight each galaxy by the inverse of 
the volume Vmax(i), over which it could have been seen. For 
the Al, — L relation, this weighting does not matter, as all 
galaxies in a given L bin have almost the same weight (the 
weighting matters very much for the L — M* relation!). The 
panel on the right shows the L — M, relation in these same 
bins, when objects have been weighted by V^^^: notice how 
the relation flattens out to constant L at small M«. The 
mass scale on which this bias appears depends on redshift, 
and is purely a consequence of the SDSS magnitude limit - 
the Knax-weighting does not solve this problem. 

Figure lX2l illustrates that this can have a dramatic effect 
on the Re — M« relation if it is measured in a narrow redshift 
bin. The panel on the left shows the Re — L relation in the 
same sequence of narrow redshift bins as before. At small L, 
the Re — L relation is the same in all the redshift bins; at 
high L where there is significant curvature in the relation, 
there is some evidence for evolution. In contrast, the Re — Mt 
relation appears to evolve dramatically, particularly at small 
M*. It is easy to see that this is a selection effect, and that it 
produces dramatic effects even though L and M» are tightly 
correlated. Consider objects in a given narrow redshift bin. 
Because of the magnitude limit, objects which scatter to 
lower L for their M* will be excluded from the sample. The 
observed sample will contain objects with large L for their 
Af, ; since L and M* are strongly correlated, this will be 
more dramatic at small Af*; since size and L are strongly 
correlated, the exclusion of small L objects biases the sample 
to large R at small A/*. 

Since M^/L increases with age, the effect of the magni- 
tude limit is particularly pernicious for studies which include 
both Af, and the age. This is shown in Figure [XS] The left 
hand panel shows that tform, the lookback time from the 
present to when the stars formed, increases slightly with lu- 
minosity. The right hand panel shows the correlation when 
L is replaced with Af*. For Af, smaller than 10^^ Af© the 
relation for the bulk of the population is changed dramat- 
ically. This is because, to make the plot on the right, we 
have shifted each object in the panel on the left by Af,/L. 
However, because we have restricted to a narrow bin in z, 
Mt/L increases with tform, so the shift is larger for large 
iform- Now, at small Af, , the objects with large Af,/L fell 
outside the magnitude limit of the survey {Mr < —21), so 
they are missing from the tform — Af, correlation. Since large 



Evolution of early-type BCGs 15 




Figure Al. Correlation between stellar mass and luminosity (left) and luminosity and stellar mass (right) in a number of narrow redshift 
bins. The flattening at (redshift dependent) small M* in the panel on the right is a selection effect which is due to the magnitude limit 
of the SDSS. 
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Figure A2. Correlation between size and luminosity (left) and stellar mass (right) in a number of narrow redshift bins. The correlation 
with luminosity curves upwards at high luminosities; at low luminosities, the relation is not curved, and is independent of redshift. The 
strongly redshift-dependent curvature at low stellar masses in the panel on the right is a selection effect which is due to the magnitude 
limit of the SDSS. 



Mt/L means large fform, the correlation between tform and 
M* curves sharply downwards as a result. We emphasize 
that this curvature is a selection effect. The satellites and 
BCGs are far enough from the limiting magnitude that they 
are less affected by this bias. 

A2 Correlations with age and M* 

The main text studies the R^ — L and Re — M» correlations 
as a function of the formation time and the age of the stel- 
lar population. However, because the age and M* estimates 
have significant uncertainties, and they are correlated, it is 
important to use ages that are output from the same models 
which estimate M, . 

Although we cannot actually plot the error in the age 
versus the error in M*/L, we expect they will be correlated 
because, for older stellar populations, M*/L oc f"'^^ or so. 



where t is the age of the population. As a result a galaxy 
that is incorrectly assigned a small age will also be assigned 
a small Af, /L ratio. In addition, if the uncertainty on the age 
is small then the uncertainty on M^/ L, and hence M* will 
also be small. This explains the trends shown in the bottom 
panels of Figure 1X41 (note that these do not show the corre- 
lated errors themselves - they show that when one quantity 
has a large error bar, then so does the other). Notice that 
when the estimated age is small, then the uncertainties on 
the age increase dramatically; this increases the uncertainty 
on A/* as well. 

Correlated errors in age and Af*/L complicate analy- 
ses of how galaxy structure correlates with formation time 
and age. To illustrate. Figure IXSl shows A/./L as a func- 
tion of L and Af* for a number of bins in formation time. 
The panel on the left is not very surprising - galaxies which 
formed longer ago have larger Af* / L ratios - although it ap- 
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Figure A3. Correlation between (lookback time to) formation and luminosity (left), and stellar mass (right), for BCGs (squares and 
crosses), satellites (triangles), and the bulk of the population (filled circles with error bars), over the redshift range 0.1 < z < 0.15. The 
drop at small Af* is a selection effect. 



pears that there may be something amiss in the bin with 
the most recent formation times. The offset from one bin 
to another is consistent with the expected fading of an old 
stellar population: M*/L oc t^'''^ or so, where t is the age 
of the population. Figure IX6l shows this explicitly: when the 
luminosity has been corrected for this age effect, then the 
different formation time bins overlap. 

It is worth noting that using age estimates from a dif- 
ferent algorithm than the one which provided the A/*/L 
estimates results in qualitatively similar behaviour to that 
shown in the lefthand panel of Figure IA5I except that the 
offset between the different formation time bins is smaller. 
This is because, if this is done, then the correlated nature of 
the age and M./L errors is missing. As a result, it would be 
possible for an object to be assigned a younger age than its 
true one, as well as a larger A/*/L ratio than its true one, 
and so objects assigned recent formation times would have 
larger Af«/L ratios on average, and objects assigned older 
formation times have smaller M^/L ratios, than when the 
errors in age and M^/ L are correlated. 

Unfortunately, the presence of correlated errors pro- 
duces spurious features in the right hand panel of Figure ESl 
In this case, errors in Af./L move objects along lines that 
slope upwards and to the right. But if an object scatters 
downwards and to the left along such a line (of constant L), 
it is also assigned a younger age, and so it contributes to a 
more recent formation time bin. If there are, in fact, no real 
galaxies having large but recent formation times, then 
the correlated errors will have produced such a population. 
Since the uncertainties are largest for the youngest galax- 
ies, we believe that it is this effect which causes the sharp 
upturn in the lowest two formation time bins. 

A better procedure, if only the age or only Af, is known, 
is to assume that d\og{M, / L) / dlogt — 0.75, and to use 
this to correct luminosities for age effects. Thus, when we 
use 7;,'^°'''' as a proxy for M,, we obtain the correct spread 
in ages at fixed L (solid lines in Figure lASp . and we find 
M^/L""'''' ^ constant (Figure HH). 



APPENDIX B: COMPARISON WITH 
PREVIOUS WORK AT Z < 0.1 

The size-luminosity relation for BCGs has been the 
subject of much recent interest ([Lauer et al. 200T1 
IBernardi et al. 20071 Ivon der Linden et al. 20071 

ILiu et al. 2008^ . Whereas most authors agree that early- 
type BCGs are very different from the bulk of the 
population, von der Linden et al. (2007) find substantially 
smaller differences. This is almost certainly because von 
der Linden et al. use Petrosian-based quantities, and these 
have been compromised by seeing dHyde fc Bernardi 2009] ). 

However, the R — L scaling relation from Bernardi et 
al. (2007) differs slightly from that found in the main text 
above. Figure IBll compares the relation for C4 BCGs re- 
ported by Bernardi et al. (2007) (long dashed line), with our 
present determination (symbols and dashed-triple-dot line) 
for the same BCGs; the Bernardi et al. sizes are slightly 
smaller, and the scaling relation is slightly shallower. We 
have traced this to the fact that, although both sizes come 
from 2d fits to the surface brightness profile, our size is an 
effective circular size {R — \/ab), whereas Bernardi et al. 
inadvertently show the minor axis 6, even though they state 
that they show \/ab (a result of correcting the long axis a 
by two powers of ^Jhja rather than just one). Correcting 
for this effect makes their relation the same as ours. For 
completeness, this relation is 

logio ( ) = 0-158 - 0.423 {Mr + 21). (Bl) 
ykpcy 

However, Figure IB2I compares the scaling we find for 
the C4 BCGs with that reported by Lauer et al. (2007). Al- 
though both relations are clearly more like one another than 
like the bulk of the population, there are significant differ- 
ences. Some of this is due to systematic differences between 
the two reductions, and, in light of the results in the main 
text, some may be due to evolution: the Lauer et al. sam- 
ple is at even lower redshifts than our C4 sample. This, of 
course, does not explain the offset at smaller luminosities. 
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Figure A4. Uncertainties on estimated ages and stellar masses. Dashed line shows the difference between the 84th percentile of likelihood 
distribution of the measured age or stellar mass and the 50th percentile; solid line shows the difference between the 16th and 50th 
percentile. The error on logjQ M» develops a tail which extends to small masses when M* is large (top left), and it increases dramatically 
for ages below 9 Gyrs (top right). The error on the age increases at small M* (middle left) and age (middle right). When the error on 
M« is large, so is the error on the age (bottom left), and vice versa (bottom right). 
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Figure A5. Stellar mass to light ratio as a function of luminosity (left), and M« (right); the slight curvature at the small Af* for each 
bin in tform is due to the same selection effect as in Figure [Xs] The sudden increase in Mt/L at larger M* (right panel) and decrease 
in Mf/L at larger L (left panel) for the younger galaxies is due to correlated errors. Symbols connected by solid and dashed lines show 
the MaxBCGs-DR6spec sample and bulk of the early-type population, respectively. Horizontal solid lines in left panel show the expected 
M*/L given the age: dlog(M*/L)/dlogt = 0.75. 



0.60 
0.50 

0.40 

0.30 

0.20 

0.10 
0.00 



. 7.5 < t; 
. 1 < t! 



[S= < 9.5 Gyrs 
< 1 1 Gyrs 




< 6 Gyrs 

< 7 Gyrs 

< 8 Gyrs 

< 9 Gyrs 

< 10 Gyrs 
< 1 1 Gyrs 



> 1 1 Gyrs 



10.0 



10.5 11.0 
Log,o 



1 1.5 



12.0 



[Ls.„] 



Figure A6. Stellar mass to light ratio as a function of luminosity, where the luminosity has been corrected for age effects assuming 
dlog{Mt / L)/dlogt = 0.75. Here, we corrected the luminosity to a population 6.5 Gyr old, i.e. logj^g LJ;"*"'' = logj^Q Lr + 0.75(logj^g t — 
logj^Q 6.5). Lines as in Figure lASl 
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Figure Bl. Comparison of our measurement of tlie size-luminosity relation in the C4 sample with that reported by Bernardi et al. 
(2007). Our sizes are larger, and the scaling relation steeper, because we set _R = \/ah, where a and b are the major and minor axis 
lengths; Bernardi et al. inadvertantly set R = b. Filled circles show the median value and its uncertainty for the bulk of the early-type 
population; thin and thick black solid lines show the linear and quadratic fits from Table 1 of Hyde & Bernardi (2009), respectively. 
Dashed and dotted curves show the regions which enclose 68% and 95% of the objects. 




Figure B2. Comparison of the size-luminosity and velocity dispersion-luminosity relations (left and right panels) in our data with that 
reported by Lauer et al. (2007). Lauer et al. work with V-band photometry; we assume V-r = 0.36. 



